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Description 

SPEECH AND MUSIC SIGNAL CODER/DECODER 

Technical Field 

The present invention relates to a coder and a decoder for 
transmitting a speech and music signal at a low bit rate. 

_ - Background of the Invention 

As a method for coding a speech signal at medium and low bit 
rates at a high efficiency, there is widely used a method for coding a 
speech signal by separating the speech signal into a linear prediction 
filter and a drive sound source signal (sound source signal) thereof. 

CELP (Code Excited Linear Prediction) is one of the 
representative methods. In CELP, a synthesized speech signal 
(reproduction signal) is generated by driving a linear prediction filter set 
with a linear prediction coefficient calculated by subjecting input speech 
to a linear prediction analysis by a sound source signal represented as a 
sum of a signal representative of a pitch period of speech and a noise- 
like signal. 

With regard to CELP, a description is given in M. Schroeder et al 
"Code excited lineaY^rediction: High quality speech at very low bit rates" 
(Proc. ICASSP, pp.937-940, 1985) (Reference 1). Further, a coding 
performance with regard to >Vmusic signal can be improved by 
constructing CELP, mentioned abbve^by a band division constitution. 
According to the constitution, a reproduction signal is generated by 
driving a linear prediction synthesis filter by an^excitation signal provided 
by adding sound source signals in correspondence^with respective 
bands. \ 
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With regard to CEIP having the band division constitution, a 
description is given ir^A. Ubale et al "Multi-band CELP Coding of Speech 
and Music" (IEEE Workshop on Speech Coding for Telecommunications, 
pp. 101 -102, 1997) (ReferencX2). 

Fig. 1 is a block diagram showing an example of a conventional 
speech and music signal coder. Here, for simplicity, a number of bands 
is set to 2. An input signal (input vector) generated by sampling speech 
or music signals and summarizing a plurality of the samples in one 
vector as one frame, is inputted from an input terminal 1 0. ~ 

A linear prediction coefficient calculating circuit 170 is inputted with 
the input vector from the input terminal 1 0. The linear prediction 
coefficient calculating circuit 170 carries out a linear prediction analysis 
with regard to the input vector and calculates a linear prediction 
coefficient. Further, the linear prediction coefficient calculating circuit 
1 70 quantizes the linear prediction coefficient and calculates a quantized 
linear prediction coefficient. The linear prediction coefficient is 
outputted to a weighting filter 140 and a weighting filter 141. An index 
in correspondence with the quantized linear prediction coefficient is 
outputted to a linear prediction synthesis filter 130, a linear prediction 
synthesis filter 131 and a code outputting circuit 190. 

A first sound source generating circuit 110 is inputted with an index 
outputted from a first minimizing circuit 150. The first sound source 
generating circuit 110 reads a first sound source vector in 
correspondence with the index from a table stored with a plurality of 
sound source vectors and outputs the first sound source vector to a first 
gain circuit 160. 

A second sound source generating circuit 111 is inputted with an 
index outputted from a second minimizing circuit 151. A second sound 
source vector in correspondence with the index is read from a table 
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stored with a plurality of sound source vectors and is outputted to a 
second gain circuit 161. 

The first gain circuit 160 is inputted with the index outputted from 
the first minimizing circuit 150 and the first sound source vector 
outputted from the first sound source generating circuit 110. The first 
gain circuit 160 reads a first gain in correspondence with the index from 
a table stored with a plurality of values of gains. Thereafter, the first 
gain circuit 160 multiplies the first gain by the first sound source vector 
and generates a third sound source vector and outputs the third sound 
source vector to a first band pass filter 120. 

The second gain circuit 161 is inputted with the index outputted 
from the second minimizing circuit 151 and the second sound source 
vector outputted from the second sound source generating circuit 111 . 
The second gain circuit 161 reads a second gain in correspondence with 
the index from a table stored with a plurality of values of gains. 
Thereafter, the second gain circuit 161 multiplies the second gain by the 
second sound source vector and generates a fourth sound source vector 
and outputs the fourth sound source vector to a second band pass filter 
121. 

The first band pass filter 120 is inputted with the third sound 
source vector outputted from the first gain circuit 160. A band of the 
third sound source vector is restricted to a first band by the filter to 
thereby generate a first excitation vector. The first band pass filter 120 
outputs the first excitation vector to the linear prediction synthesis filter 
130. 

The second band pass filter 121 is inputted with the fourth sound 
source vector outputted from the second gain circuit 161. A band of the 
fourth sound source vector is restricted to a second band by the filter to 
thereby generate a second excitation vector. The second band pass 
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filter 121 outputs the second excitation vector to the linear prediction 

synthesis filter 131. 

The linear prediction synthesis filter 130 is inputted with the first 

excitation vector outputted from the first band pass filter 120 and an 

index in correspondence with the quantized linear prediction coefficient 

outputted from the linear prediction coefficient calculating circuit 170. 

The linear prediction synthesis filter 130 reads the quantized linear 
_ prediction coefficient in correspondence with the index from a table 

stored with a plurality of the quantized linear prediction coefficients. By 
i 3 driving the filter set with the quantized linear prediction coefficient by the 

!; j first excitation vector, a first reproduction signal (reproduced vector) is 

J '* generated. The first reproduced vector is outputted to a first differencer 

!:H 180. 

m The linear prediction synthesis filter 131 is inputted with the 

i,3 second excitation vector outputted from the second band pass filter 121 

u and an index in correspondence with the quantized linear prediction 

j." coefficient outputted from the linear prediction coefficient calculating 

circuit 170. The linear prediction synthesis filter 131 reads the 
quantized linear prediction coefficient in correspondence with the index 
from a table stored with a plurality of quantized linear prediction 
coefficients. By driving the filter set with the quantized linear prediction 
coefficient by the second excitation vector, a second reproduced vector 
is generated. The second reproduced vector is outputted to a second 
differencer 181. 

The first differencer 180 is inputted with the input vector via the 
input terminal 10 and is inputted with the first reproduced vector 
outputted from the linear prediction synthesis filter 1 30. The first 
differencer 180 calculates a difference between the input vector and the 
first reproduced vector. The difference is outputted to the weighting 
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filter 140 and the second differencer 181 as a first difference vector. 

The second differencer 181 is inputted with the first difference 
vector from the first differencer 180 and is inputted with the second 
reproduced vector outputted from the linear prediction synthesis filter 
131. The second differencer 181 calculates a difference between the 
first difference vector and the second reproduced vector. The 
difference is outputted to the weighting filter 141 as a second difference 

- - vector, .. .. _ .. 

The weighting filter 140 is inputted with the first difference vector 
3 outputted from the first differencer 180 and the linear prediction 

;3 coefficient outputted from the linear prediction coefficient calculating 

t circuit 1 70. The weighting filter 140 generates a weighting filter in 

§ correspondence with an auditory characteristic of human being by using 

n the linear prediction coefficient and drives the above-described 

3 weighting filter by the first difference vector. By the above-described 

't operation of the weighting filter 140, a first weighted difference vector is 

^ generated. The first weighted difference vector is outputted to the first 

minimizing circuit 150. 

The weighting filter 141 is inputted with the second difference 
vector outputted from the second differencer 181 and the linear 
prediction coefficient outputted from the linear prediction coefficient 
calculating circuit 170. The weighting filter 141 generates a weighting 
filter in correspondence with the auditory characteristic of human being 
by using the linear prediction coefficient and drives the above-described 
weighting filter by the second difference vector. By the above- 
described operation of the weighting filter 141, a second weighted 
difference vector is generated. The second weighted difference vector 
is outputted to the second minimizing circuit 151 . 



The first minimizing circuit 150 successively outputs indexes in 
correspondence with all of the first sound source vectors stored in the 
first sound source generating circuit 110 to the first sound source 
generating circuit 110 and successively outputs indexes in 
correspondence with all of the first gains stored in the first gain circuit 
160 to the first gain circuit 160. Further, the first minimizing circuit 150 
is successively inputted with the first weighted difference vector 
- outputted from the weighting filter 140. The first minimizing circuit 1 50 
calculates a norm thereof. The first minimizing circuit 150 selects the ~ 
first sound source vector and the first gain to minimize the norm and 
outputs an index in correspondence with these to the code outputting 
circuit 190. 

The second minimizing circuit 151 successively outputs indexes in 
correspondence with all of the second sound source vectors stored in 
the second sound source generating circuit 111 to the second sound 
source generating circuit 111 and successively outputs indexes in 
correspondence with all of the second gains stored in the second gain 
circuit 161 to the second gain circuit 161 . Further, the second 
minimizing circuit 151 is successively inputted with the second weighted 
difference vector outputted from the weighting filter 141 . The second 
minimizing circuit 151 calculates a norm thereof. The second gain 
circuit 161 selects the second sound source vector and the second gain 
to minimize the norm and outputs an index in correspondence with these 
to the code outputting circuit 1 90. 

The code outputting circuit 190 is inputted with an index in 
correspondence with the quantized linear prediction coefficient outputted 
from the linear prediction coefficient calculating circuit 170, inputted with 
indexes outputted from the first minimizing circuit 150 in correspondence 
with respectives of the first sound source vector and the first gain and 



inputted with indexes outputted from the second minimizing circuit 151 in 
correspondence with respectives of the second sound source vector and 
the second gain. The code outputting circuit 190 converts the 
respective indexes into codes of bit series and outputs the respective 
indexes after conversion via an output terminal 20. 

Fig. 2 is a block diagram showing an example of a conventional 
speech and music signal decoding apparatus. A code inputting circuit 
310 is inputted with a code in a bit series from an input terminal 30. 

The code input circuit 310 converts the code in the bit series 
inputted from the input terminal 30 into indexes. An index in 
correspondence with a first sound source vector is outputted to a first 
sound source generating circuit 110. An index in correspondence with 
a second sound source vector is outputted to a second sound source 
generating circuit 111 . An index in correspondence with a first gain is 
outputted to a first gain circuit 160. An index in correspondence with a 
second gain is outputted to a second gain circuit 161 . An index in 
correspondence with a quantized linear prediction coefficient is 
outputted to a linear prediction synthesis filter 130 and a linear prediction 
synthesis filter 131. 

The first sound source generating circuit 110 is inputted with the 
index outputted from the code inputting circuit 310. The first sound 
source generating circuit 110 reads the first sound source vector in 
correspondence with the index from a table stored with a plurality of 
sound source vectors and outputs the sound source vector to the first 
gain circuit 160. 

The second sound source generating circuit 111 is inputted with 
the index outputted from the code inputting circuit 31 0. The second 
sound source generating circuit 111 reads the second sound source 
vector in correspondence with the index from a table stored with a 
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plurality of sound source vectors and outputs the second sound source 
vector to the second gain circuit 161 . 

The first gain circuit 160 is inputted with the index outputted from 
the code inputting circuit 310 and the first sound source vector outputted 
from the first sound source generating circuit 110. The first gain circuit 
160 reads a first gain in correspondence with the index from a table 
stored with a plurality of values of gains. The first gain circuit 160 
generates a third sound source vector by multiplying the first gain by the 
first sound source vector. The third sound source vector is outputted to 
a first band pass filter 1 20. 

The second gain circuit 161 is inputted with the index outputted 
from the code inputting circuit 310 and the second sound source vector 
outputted from the second sound source generating circuit 111 . The 
second gain circuit 161 reads a second gain in correspondence with the 
index from a table stored with a plurality of values of gains. Thereafter, 
the second gain circuit 161 generates a fourth sound source vector by 
multiplying the second gain by the second sound source vector. The 
fourth sound source vector is outputted to a second band pass filter 121 . 

The first band pass filter 120 is inputted with the third sound 
source vector outputted from the first gain circuit 160. A band of the third 
sound source vector is restricted to a first band by the filter and the third 
sound source vector generates a first excitation vector. The first band 
pass filter 120 outputs the first excitation vector to the linear prediction 
synthesis filter 130. 

The second band pass filter 121 is inputted with the fourth sound 
source vector outputted from the second gain circuit 161. A band of the 
fourth sound source vector is restricted to a second band by the filter 
and accordingly, the second band pass filter 121 generates a second 
excitation vector. The second band pass filter 121 outputs the second 
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excitation vector to the linear prediction synthesis filter 131 . 

The linear prediction synthesizing vector 130 is inputted with the 
first excitation vector outputted from the first band pass filter 120 and the 
index in correspondence with the quantized linear prediction coefficient 
outputted from the code inputting circuit 31 0. The quantized linear 
prediction coefficient in correspondence with the index is read from a 
table stored with a plurality of quantized linear prediction coefficients. 
Thereafter, the linear prediction synthesis filters 130 generates a first 
reproduced vector by driving the filter set with the quantized linear 
prediction coefficient by the first excitation vector. The first reproduced 
vector is outputted to an adder 182. 

The linear prediction synthesis filter 131 is inputted with the 
second excitation vector outputted from the second band pass filter 121 
and the index in correspondence with the quantized linear prediction 
coefficient outputted from the code inputting circuit 31 0. The quantized 
linear prediction coefficient in correspondence with the index is read 
from a table stored with a plurality of quantized linear prediction 
coefficients. The linear prediction synthesis filter 131 generates a 
second reproduced vector by driving the filter set with the quantized 
linear prediction coefficient by the second excitation vector. The 
second reproduced vector is outputted to the adder 182. 

The adder 182 is inputted with the first reproduced vector 
outputted from the linear prediction synthesis filter 130 and the second 
reproduced vector outputted from the linear prediction synthesis filter 
131 . A sum of these is calculated. The adder 182 outputs the sum of 
the first reproduced vector and the second reproduced vector as a third 
reproduced vector via an output terminal 40. 

According to the above-described conventional speech and music 
signal coder, there is constructed the constitution in which the 
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reproduction signal is generated by driving the linear prediction 
synthesis filters calculated from the input signal by the excitation signal 
provided by adding the excitation signal having a band characteristic in 
correspondence with a low region of the input signal and the excitation 
signal having a band characteristic in correspondence with a high region 
of the input signal and accordingly, a coding operation based on CELP is 
carried out in a band belonging to a high frequency region and 

accordingly, coding performance is deteriorated in the band belonging to — - 

the high frequency region and therefore, coding quality of the speech 
and music signal in all of bands is deteriorated. 
j3 The reason is that a signal in the band belonging to the high 

if; frequency region is provided with a property significantly different from 

CO speech and therefore, according to CELP modeling a procedure of 

m generating speech, the signal in the band belonging to the high 

f - 3 frequency region cannot be generated with a high accuracy. 

! Jf It is an object of the invention to provide a speech and music 

j'y signal coder capable of resolving the above-described problem and 

\ - M coding a speech and music signal over all of bands. 

Disclosure of the Invention 

An apparatus of coding a speech and music signal according to 
the invention (apparatus of the invention 1) generates a first 
reproduction signal by driving a linear prediction synthesis filter 
calculated from an input signal by an excitation signal in correspondence 
with a first band, generates a residual signal by driving an inverse filter of 
the linear prediction synthesis filter by a differential signal of the input 
signal and the first reproduction signal and codes a component in 
correspondence with a second band in the residual signal after 
subjecting the component to orthogonal transformation. 
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Specifically the apparatus of the invention 1 includes means (110, 
160, 120, 130 of Fig. 3) for generating a first reproduction signal by 
driving the linear prediction synthesis filter by the excitation signal in 
correspondence with the first band, means (180, 230 of Fig. 3) for 
generating a residual signal by driving an inverse filter of the linear 
prediction synthesis filter by a differential signal of the input signal and 
the first reproduction signal, and means (240, 250, 260 of Fig. 3) for 
coding a component in correspondence with the second band in the 
residual signal after subjecting the component to orthogonal 
transformation. 

An apparatus of coding of a speech and music signal according to 
the invention (apparatus of the invention 2) generates a first and a 
second reproduction signal by driving a linear prediction synthesis filter 
calculated from an input signal by excitation signals in correspondence 
with a first and a second band, generates a residual signal by driving an 
inverse filter of the linear prediction synthesis filter by a differential signal 
of a signal produced by adding the first and the second reproduction 
signals and the input signal and codes a component in correspondence 
with a third band in the residual signal after subjecting the component to 
orthogonal transformation. 

Specifically, the apparatus of the invention 2 includes means 
(1001 , 1002 of Fig. 10) for generating a first and a second reproduction 
signal by driving the linear prediction synthesis filter by the excitation 
signals in correspondence with a first one and a second one of the 
bands, and means (1003 of Fig. 10) for generating a residual signal by 
driving an inverse filter of the linear prediction synthesis filter by a 
differential signal of a signal produced by adding the first and the second 
reproduction signals and the input signal and coding a component in 
correspondence with a third one of the bands in the residual signal after 
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subjecting the component to orthogonal transformation. 

An apparatus of coding a speech and music signal according to 
the invention (apparatus of the invention 3) generates a first through an 
(N-1)-th reproduction signal by driving a linear prediction synthesis filter 
calculated from an input signal by excitation signals in correspondence 
with a first through an (N-1)-th band, generates a residual signal by 
driving an inverse filter of the linear prediction synthesis filter by a 
differential signal of a signal produced by adding a first through an (N- 
1)-th reproduction signal and the input signal and codes a component in 
correspondence with an N-th band in the residual signal after subjecting 
the component to orthogonal transformation. 

Specifically, the apparatus of the invention 3 includes means 
(1001 , 1004 of Fig. 11) for generating a first through an (N-1)-th 
reproduction signal by driving the linear prediction synthesis filter by 
excitation signals in correspondence with a first through an (N-1)-th 
band, and means (1005 of Fig. 11) for generating a residual signal by 
driving an inverse filter of the linear prediction synthesis filter by a 
differential signal of a signal produced by adding the first through the 
(N-1)-th reproduction signals and the input signal and coding a 
component in correspondence with an N-th band in the residual signal 
after subjecting the component to orthogonal transformation. 

An apparatus of coding a speech and music signal according to 
the invention (apparatus of the invention 4) generates, in second coding 
operation, a residual signal by driving an inverse filter of a linear 
prediction synthesis filter calculated from an input signal by a differential 
signal of a first coded decoding signal and the input signal and codes a 
component in correspondence with an arbitrary band in the residual 
signal after subjecting the component to orthogonal transformation. 
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Specifically, the apparatus of the invention 4 includes means (180 
of Fig. 13) for calculating a difference of a first coded decoding signal 
and the input signal and means (1002 of Fig. 13) for generating a 
residual signal by driving an inverse filter of the linear prediction 
synthesis filter calculated from the input signal by the differential signal 
and coding a component in correspondence with an arbitrary one of the 
bands in the residual signal after subjecting the component to 
orthogonal transformation. 

An apparatus of coding a speech and music signal according to 
the invention (apparatus of the invention 5) generates, in third coding 
operation, a residual signal by driving an inverse filter of a linear 
prediction synthesis filter calculated from an input signal by a differential 
signal of a signal produced by adding a first and a second coded 
decoding signal and the input signal and codes a component in 
correspondence with an arbitrary band in the residual signal after 
subjecting the component to orthogonal transformation. 

Specifically, the apparatus of the invention 5 includes means 
(1801 , 1802 of Fig. 14) for calculating a differential signal of a signal 
produced by adding a first and a second coded decoding signal and the 
input signal, and means (1003 of Fig. 14) for generating a residual signal 
by driving an inverse filter of the linear prediction synthesis filter 
calculated from the input signal by the differential signal and coding a 
component in correspondence with an arbitrary band in the residual 
signal after subjecting the component to orthogonal transformation. 

An apparatus of coding a speech and music signal according to 
the invention (apparatus of the invention 6) generates, in N-th coding 
operation, a residual signal by driving an inverse filter of a linear 
prediction synthesis filter calculated from an input signal by a differential 
signal of a signal produced by adding a first through an (N-1)-th coded 



14 



decoding signal and the input signal and codes a component in 
correspondence with an arbitrary band in the residual signal after 
subjecting the component to orthogonal transformation. 

Specifically, the apparatus of the invention 6 includes means 
(1 801 , 1 802 of Fig. 1 5) for calculating a differential signal of a signal 
produced by adding a first through an (N-1)-th coded decoding signals 
and the input signal, and means (1005 of Fig. 15) for generating a 
residua! signal by driving an inverse filter of the linear prediction 
synthesis filter calculated from the input signal by the differential signal 
and coding a component in correspondence with an arbitrary band in the 
residual signal after subjecting the component to orthogonal 
transformation. 

An apparatus of coding a speech and music signal according to 
the invention (apparatus of the invention 7) uses a pitch prediction filter 
in generating an excitation signal in correspondence with a first band of 
an input signal. Specifically, the apparatus of the invention 7 includes 
pitch predicting means (112, 162, 184, 510 of Fig. 16). 

An apparatus of coding a speech and music signal according to 
the invention (apparatus of the invention 8) generates a second input 
signal by down-sampling a first input signal sampled at a first sampling 
frequency to a second sampling frequency, generates a first 
reproduction signal by driving a synthesis filter set with a first linear 
prediction coefficient calculated from the second input signal by an 
excitation signal, generates a second reproduction signal by up- 
sampling the first reproduction signal to a first sampling frequency, 
further, calculates a third linear prediction coefficient from a difference of 
a linear prediction coefficient calculated from the first input signal and a 
second linear prediction coefficient provided by subjecting the first linear 
prediction coefficient to the first sampling frequency by sampling 
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frequency conversion, calculates a fourth linear prediction coefficient 
from a sum of the second linear prediction coefficient and the third linear 
prediction coefficient, generates a residual signal by driving an inverse 
filter set with the fourth linear prediction coefficient by a differential signal 
of the first input signal and the second reproduction signal and codes a 
component in correspondence with an arbitrary band in the residual 
signal after subjecting the component to orthogonal transformation. 

Specifically, the apparatus of the invention 8 includes means (780 
of Fig. 1 7) for generating a second input signal by down-sampling a first 
input signal sampled at a first sampling frequency to a second sampling 
frequency, means (770, 132 of Fig. 17) for generating a first reproduction 
signal by driving a synthesis filter set with a first linear prediction 
coefficient calculated from the second input signal by an excitation signal, 
means (781 of Fig. 17) for generating a second reproduction signal by 
up-sampling the first reproduction signal to the first sampling frequency, 
means (771 , 772 of Fig. 17) for calculating a third linear prediction 
coefficient from a difference of a linear prediction coefficient calculated 
from the first input signal, the first linear prediction coefficient and a 
second linear prediction coefficient provided by converting a sampling 
frequency to the first sampling frequency, means (180, 730 of Fig. 17) for 
calculating a fourth linear prediction coefficient from a sum of the second 
linear prediction coefficient and the third linear prediction coefficient and 
generating a residual signal by driving an inverse filter set with the fourth 
linear prediction coefficient by a differential signal of the first input signal 
and the second reproduction signal and means (240, 250, 260 of Fig. 
17) for coding a component in correspondence with an arbitrary band in 
the residual signal after subjecting the component to orthogonal 
transformation. 
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An apparatus of decoding a speech and music signal according to 
the invention (apparatus of the invention 9) generates an excitation 
signal in correspondence with a second band by subjecting a decoded 
orthogonal transformation coefficient to orthogonal inverse 
transformation, generates a second reproduction signal by driving a 
linear prediction synthesis filter by the excitation signal, further, 
generates a first reproduction signal by driving the linear prediction filter 
by an excitation signal in correspondence with a decoded first band and 
generates decoded speech and music by adding the first reproduction 
signal and the second reproduction signal. 

Specifically, the apparatus of the invention 9 includes means (440, 
460 of Fig. 18) for generating the excitation signal in correspondence 
with the second band by subjecting a decoding signal and an orthogonal 
transformation coefficient to orthogonal inverse transformation, means 
(131 of Fig. 18) for generating a second reproduction signal by driving 
the linear prediction synthesis filter by the excitation signal, means (110, 
120, 130, 160 of Fig. 18) for generating a first reproduction signal by 
driving the linear prediction filter by the excitation signal in 
correspondence with the first band, and means (182 of Fig. 18) for 
generating decoded speech and music by adding the first reproduction 
signal and the second reproduction signal. 

An apparatus of decoding a speech and music signal according to 
the invention (apparatus of the invention 10) generates an excitation 
signal in correspondence with a third band by subjecting a decoded 
orthogonal transformation coefficient to orthogonal inverse 
transformation, generates a third reproduction signal by driving a linear 
prediction synthesis filter by the excitation signal, further, generates a 
first and a second reproduction signal by driving the linear prediction 
filter by excitation signals in correspondence with decoded first and 
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second bands and generates decoded speech and music signal by 
adding the first through the third reproduction signals. 

Specifically, the apparatus of the invention 10 includes means 
(1053 of Fig. 24) for generating the excitation signal in correspondence 
with the third band by subjecting a decoded orthogonal transformation 
coefficient to orthogonal inverse transformation and generating a third 
reproduction signal by driving the linear prediction synthesis filter by the 
excitation signal, means (1051 , 1052 of Fig. 24) for generating a first and 
a second reproduction signal by driving the linear prediction filter by the 
excitation signals in correspondence with the first and the second bands, 
and means (1821, 1822 of Fig. 24) for generating decoded speech and 
music by adding the first through the third reproduction signals. 

An apparatus of decoding a vocal music signal according to the 
invention (apparatus of the invention 11) generates an excitation signal 
in correspondence with an N-th band by subjecting a decoded 
orthogonal transformation coefficient to orthogonal inverse 
transformation, generates an N-th reproduction signal by driving a linear 
prediction synthesis filter by the excitation signal, further, generates a 
first through an (N-1)-th reproduction signal by driving the linear 
prediction filter by excitation signals in correspondence with decoded 
first through (N-1)-th band and generates decoded vocal music by 
adding the first through the N-th reproduction signals. 

Specifically, the apparatus of the invention 11 includes means 
(1055 of Fig. 25) for generating an excitation signal in correspondence 
with the N-th band by subjecting a decoded orthogonal transformation 
coefficient to orthogonal inverse transformation and generating an N-th 
reproduction signal by driving the linear prediction synthesis filter by the 
excitation signal, means (1051 , 1054 of Fig. 25) for generating a first 
through an (N-1)-th reproduction signal by driving the linear prediction 
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filter by the excitation signals in correspondence with the first through 
the (N-1)-th bands, and means (1821, 1822 of Fig. 25) for generating 
decoded vocal music by adding the first through the N-th reproduction 
signals. 

An apparatus of decoding a vocal music signal according to the 
invention (apparatus of the invention 12) generates, in second decoding 
operation, an excitation signal by subjecting a decoded orthogonal 

. transformation coefficient.to orthogonal inverse transformation, 

generates a reproduction signal by driving a linear prediction synthesis 
filter by the excitation signal and generates decoded vocal music by 
adding the reproduction signal and the first decoded signal. 

Specifically, the apparatus of the invention 12 includes means 
(1052 of Fig. 26) for generating an excitation signal by subjecting a 
decoded orthogonal transformation coefficient to orthogonal inverse 
transformation and generating a reproduction signal by driving a linear 
prediction synthesis filter by the excitation signal, and means (182 of Fig. 
26) for generating decoded vocal music by adding the reproduction 
signal and a first decoding signal. 

An apparatus of decoding a vocal music signal according to the 
invention (apparatus of the invention 13) generates, in third decoding 
operation, an excitation signal by subjecting a decoded orthogonal 
transformation coefficient to orthogonal inverse transformation, 
generates a reproduction signal by driving a linear prediction synthesis 
filter by the excitation signal and generates decoded vocal music by 
adding the reproduction signal and a first and a second decoding signal. 

Specifically, the apparatus of the invention 13 includes means 
(1053 of Fig. 27) for generating the excitation signal by subjecting a 
decoded orthogonal transformation coefficient to orthogonal inverse 
transformation and generating a reproduction signal by driving the linear 
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prediction synthesis filter by the excitation signal, and decoded vocal 
music generating means (1821 , 1822 of Fig. 27) for generating decoded 
vocal music by adding the reproduction signal and a first and a second 
decoding signal. 

An apparatus of decoding a vocal music signal according to the 
invention (apparatus of the invention 14) generates, in N-th decoding 
operation, an excitation signal by subjecting a decoded orthogonal 
transformation coefficient to orthogonal inverse transformation and __. 
generates a reproduction signal by driving a linear prediction synthesis 
filter by the excitation signal and generates decoded vocal music by 
adding the reproduction signal and a first through an (N-1)-th decoding 
signal. 

Specifically, the apparatus of the invention 14 includes means 
(1055 of Fig. 28) for generating the excitation signal by subjecting a 
decoded orthogonal transformation coefficient to orthogonal inverse 
transformation and generating a reproduction signal by driving the linear 
prediction synthesis filter by the excitation signal, and means (1821, 
1822 of Fig. 28) for generating decoded vocal music by adding the 
reproduction signal and a first through an (N-1)-th decoding signal. 

An apparatus of decoding a vocal music signal according to the 
invention (apparatus of the invention 15) uses a pitch prediction filter in 
generating an excitation signal in correspondence with a first band. 
Specifically, the apparatus of the invention 15 further includes pitch 
predicting means (112, 162, 184, 510 of Fig. 29). 

An apparatus of decoding a vocal music signal according to the 
invention (apparatus of the invention 16) generates a first reproduction 
signal by up-sampling a signal provided by driving a first linear prediction 
synthesis filter by a first excitation signal in correspondence with a first 
band to a first sampling frequency, generates a second excitation signal 
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in correspondence with a second band by subjecting a decoded 
orthogonal transformation coefficient to orthogonal inverse 
transformation, generates a second reproduction signal by driving a 
second linear prediction synthesis filter by the second excitation signal 
and generates decoded vocal music by adding the first reproduction 
signal and the second reproduction signal. 

Specifically, the apparatus of the invention 16 includes means 
(132, 781 of Fig. 30) for generating a first reproduction signal by up- 
sampling a signal provided by driving a first linear prediction synthesis 
filter by a first excitation signal in correspondence with a first band to a 
first sampling frequency, means (440, 831 of Fig. 30) for generating a 
second excitation signal in correspondence with a second band by 
subjecting a decoded orthogonal transformation coefficient to orthogonal 
inverse transformation and generating a second reproduction signal by 
driving a second linear prediction synthesis filter by the second 
excitation signal, and means (182 of Fig. 30) for generating decoded 
vocal music by adding the first reproduction signal and the second 
reproduction signal. 

An apparatus of decoding a code of a vocal music signal 
according to the invention (apparatus of the invention 17) decodes a 
code outputted from the apparatus of the invention 1 by the apparatus of 
the invention 9. Specifically, the apparatus of the invention 17 includes 
the vocal music signal coding means (Fig. 3) and the vocal music signal 
decoding means (Fig. 18). 

An apparatus of decoding a code of a vocal music signal 
according to the invention (apparatus of the invention 18) decodes a 
code outputted from the apparatus of the invention 2 by the apparatus of 
the invention 10. Specifically, the apparatus of the invention 18 
includes the vocal music signal coding means (Fig. 10) and the vocal 
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music signal decoding means (Fig. 24). 

An apparatus of decoding a code of a vocal music signal 
according to the invention (apparatus of the invention 1 9) decodes a 
code outputted from the apparatus of the invention 3 by the apparatus of 
the invention 11 . Specifically, the apparatus of the invention 19 
includes the vocal music signal coding means (Fig. 11) and the vocal 
music signal decoding means (Fig. 25). 

An apparatus of decoding a code of a vocal music signal 
according to the invention (apparatus of the invention 20) decodes a 
code outputted from the apparatus of the invention 4 by the apparatus of 

0 the invention 12. Specifically, the apparatus of the invention 20 

il includes the vocal music signal coding means (Fig. 13) and the vocal 

m music signal decoding means (Fig. 26). 

j| An apparatus of decoding a code of a vocal music signal 

^ according to the invention (apparatus of the invention 21) decodes a 

1 U code outputted from the apparatus of the invention 5 by the apparatus of 
f y the invention 13. Specifically, the apparatus of the invention 21 

il includes the vocal music signal coding means (Fig. 14) and the vocal 

music signal decoding means (Fig. 27). 

An apparatus of decoding a code of a vocal music signal 
according to the invention (apparatus of the invention 22) decodes a 
code outputted from the apparatus of the invention 6 by the apparatus of 
the invention 14. Specifically, the apparatus of the invention 22 
includes the vocal music signal coding means (Fig. 15) and the vocal 
music signal decoding means (Fig. 28). 

An apparatus of decoding a code of a vocal music signal 
according to the invention (apparatus of the invention 23) decodes a 
code outputted from the apparatus of the invention 7 by the apparatus of 
the invention 15. Specifically, the apparatus of the invention 23 
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includes the vocal music signal coding means (Fig. 16) and the vocal 
music signal decoding means (Fig. 29). 

An apparatus of decoding a code of a vocal music signal 
according to the invention (apparatus of the invention 24) decodes a 
code outputted from the apparatus of the invention 8 by the apparatus of 
the invention 16. Specifically, the apparatus of the invention 24 
includes the vocal music signal coding means (Fig. 17) and the vocal 
music signal decoding means (Fig. 30). 

According to the invention, a first reproduction signal is generated 
by driving a linear prediction synthesis filter calculated from an input 
signal by an excitation signal having a band characteristic in 
correspondence with a low region of the input signal, generates a 
residual signal by driving an inverse filter of the linear prediction 
synthesis filter by a differential signal of the input signal and the first 
reproduction signal and codes a high region component of the residual 
signal by using a coding system based on orthogonal transformation. 
That is, with regard to a signal having a property different from that of 
speech in a band belonging to a high frequency region, there is carried 
out coding operation based on orthogonal transformation in place of 
CELP. According to the coding operation based on the orthogonal 
transformation, coding performance with respect to a signal having 
property different from that of speech is higher than that of CELP. 
Therefore, the coding performance with regard to a high region 
component of the input signal is improved. As a result, a vocal music 
signal can excellently be coded over all of bands. 

Brief Description of the Drawings 

Fig. 1 is a block diagram showing an embodiment of a speech and 
music signal coder according to a conventional method. 
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Fig. 2 is a block diagram showing an embodiment of a vocal music 
signal decoding apparatus according to a conventional method. 

Fig. 3 is a block diagram showing a constitution of a speech and 
music signal coder according to a first embodiment of the invention. 

Fig. 4 is a block diagram showing a constitution of a first sound 
source generating circuit 110. 

Fig. 5 is a view for explaining a method of generating a subvector 

— _ in a band selecting circuit 250. _.. . 

Fig. 6 is a block diagram showing a constitution of an orthogonal 
transformation coefficient quantizing circuit 260. 
3 Fig. 7 is a block diagram equivalent to Fig. 3 showing the 

1 2 constitution of the speech and music signal coder according to the first 

* n embodiment of the invention. 

!.„ Fig. 8 is a block diagram showing a constitution of a first coding 

^ circuit 1001 in Fig. 5. 

! LI Fig. 9 is a block diagram showing a constitution of a second 

I: A 

IU coding circuit 1002 in Fig. 5. 

j!l Fig. 10 is a block diagram showing a constitution of a speech and 

music signal coder according to a second embodiment of the invention. 

Fig. 11 is a block diagram showing a constitution of a speech and 
music signal coder according to a third embodiment of the invention. 

Fig. 12 is a block diagram showing a constitution of a first coding 
circuit 1011 in Fig. 31 . 

Fig. 13 is a block diagram showing a constitution of a speech and 
music signal coder according to a fourth embodiment of the invention. 

Fig. 14 is a block diagram showing a constitution of a speech and 
music signal coder according to a fifth embodiment of the invention. 

Fig. 15 is a block diagram showing a constitution of a speech and 
music signal coder according to a sixth embodiment of the invention. 
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Fig. 16 is a block diagram showing a constitution of a speech and 
music signal coder according to a seventh embodiment of the invention. 

Fig. 17 is a block diagram showing a constitution of a speech and 
music signal coder according to an eighth embodiment of the invention. 

Fig. 18 is a block diagram showing a constitution of a vocal music 
signal decoding apparatus according to a ninth embodiment of the 
invention. 

Fig. J 9 is a view for explaining a method of generating a second 
excitation vector in an orthogonal transformation coefficient inversely 
quantizing circuit 460. 

Fig. 20 is a block diagram showing a constitution of the orthogonal 
transformation coefficient inversely quantizing circuit 460. 

Fig. 21 is a block diagram equivalent to Fig. 36 showing the 
constitution of the vocal music signal decoding apparatus according to 
the ninth embodiment of the invention. 

Fig. 22 is a block diagram showing a constitution of a first 
decoding circuit 1051 in Fig. 39. 

Fig. 23 is a block diagram showing a constitution of a second 
decoding circuit 1052 in Fig. 39. 

Fig. 24 is a block diagram showing a constitution of a vocal music 
signal decoding apparatus according to a tenth embodiment of the 
invention. 

Fig. 25 is a block diagram showing a constitution of a vocal music 
signal decoding apparatus according to an eleventh embodiment of the 
invention. 

Fig. 26 is a block diagram showing a constitution of a vocal music 
signal decoding apparatus according to a twelfth embodiment of the 
invention. 
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Fig. 27 is a block diagram showing a constitution of a vocal music 
signal decoding apparatus according to a thirteenth embodiment of the 
invention. 

Fig. 28 is a block diagram showing a constitution of a vocal music 
signal decoding apparatus according to a fourteenth embodiment of the 
invention. 

Fig. 29 is a block diagram showing a constitution of a vocal music 
signal decoding apparatus according to a fifteenth embodiment of the 
invention. 

Fig. 30 is a block diagram showing a constitution of a vocal music 
signal decoding apparatus according to a sixteenth embodiment of the 
invention. 

Fig. 31 is a view for explaining a correspondence between an 
index and a code of a bit series in a code outputting circuit 290. 

Fig. 32 is a view for explaining a method of generating a first pitch 
vector in a pitch signal generating circuit 112. 

Best Mode for Carrying Out the Invention 

Fig. 3 is a block diagram showing a constitution of a speech and 
music signal coder according to a first embodiment of the invention. 
Here, an explanation will be given thereof with a number of bands as 2. 
An input signal (input vector) generated by sampling a speech or music 
signal and summarizing a plurality of the samples in one vector as one 
frame, is inputted from an input terminal 10. The input vector is 
represented as x(n), n=0, L-1. Incidentally, notation L designates a 
vector length. A band of the input signal is restricted to FsO [Hz] 
through FeO [Hz]. For example, a sampling frequency is set to 16 [kHz] 
and FsO and FeO are set as Fs0=50 [Hz] and Fe0=7000 [Hz]. 
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A linear prediction coefficient calculating circuit 170 inputs the 
input vectonfrom the input terminal 10, carries out linear prediction 
analysis withVegard to the input vector, calculates linear prediction 

coefficients ai i=1 Np, further, quantizes the linear prediction 

coefficients and calculates quantized linear prediction coefficients ai', 
i=1 , Np. Hfere, notation Np designates a linear prediction degree, for 
example, 16. Further, the linear prediction coefficient calculating circuit 
170 outputs the linear prediction coefficients to a weighting filter 140 and 
outputs indexes ^correspondence with the quantized linear prediction 
coefficients to a linear prediction synthesis filter 130, a linear prediction 
inverse filter 230 ano\a code outputting circuit 290. With regard to 
quantization of the linear prediction coefficient, there is, for example, a 
method of converting the linear prediction coefficient to a line spectrum 
pair (LSP) and quantizing the converted linear prediction coefficient. 
With regard to conversionVof the linear prediction coefficient into LSP, a 
description is given by Sugamura et al "Speech information compression 
by a linear spectrum pair (LSP) speech analyzing and synthesizing 
system" (Proceeding of Electronic, Information and Communication 
Society A, VoU64-A, No.8, pd599-606, 1981) (Reference 3). With 
regard to quantization of LSP, a description is given by Omuro et al 
"Vector quantization of an LSP parameter by using moving average type 
interframe prediction" (Proceeding of Electronic, Information and 
Communication Society A, Vol.J77\A, No.3, pp.303-312, 1994) 
(Reference 4). \ 

A first sound source generating circuit 110 inputs an index 
outputted from a first minimizing circuit 1 50. A first sound source vector 
in correspondence with the index is read from a table stored with a 
plurality of sound source signals (sound source vectors) and is outputted 
to a first gain circuit 160. Here, a description will be given of a 
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constitution of the first sound source generating circuit 110 in reference 
to Fig. 4. A table 1101 provided by the first sound source generating 
circuit 110 is stored with Ne pieces of sound source vectors. For 
example, Ne is 256. A switch 1102 is inputted with an index "i" 
outputted from the first minimizing circuit 150 via an input terminal 1103. 
The switch 1102 selects a sound source vector in correspondence with 
the index from the table and outputs the sound source vector as a first 
sound source vector to the first gain circuit 160 via an output terminal 
1104. 

Further, with regard to coding of a sound source signal, there can 
be used^ method of efficiently expressing a sound source signal by a 
multiple puJse signal comprising a plurality of pulses and prescribed by 
positions of Vie pulses and amplitudes of the pulses. With regard to 
coding of a sound source signal using a multiple pulse signal, a 
description is given by Ozawa et al "MP-CELP speech codification 
based on a multiple pulse spectra quantized sound source and high 
speed search" (Proceeding of Electronic, Information and 
Communication Society A, pp. 1655-1 663, 1 996) (Reference 5). By the 
above-described, an explanation of the first sound source generating 
circuit 110 is finished. \ 

Returning to the explanation of Fig. 3, the first gain circuit 160 is 
provided with a table stored with values of gains. The first gain circuit 
160 is inputted with the index outputted from the first minimizing circuit 
150 and the first sound source vector outputted from the first sound 
source generating circuit 110. A first gain in correspondence with the 
index is read from the table and the first gain is multiplied by the first 
sound source vector to thereby form a second sound source vector. 
The generated second sound source vector is outputted to a first band 
pass filter 120. 
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The first band pass filter 120 is inputted with the second sound 
vector outputted from the first gain circuit 1 60. A band of the second 
sound source vector is restricted to a first band by this filter to thereby 
provide a first excitation vector. The first band pass filter 120 outputs 
the first excitation vector to the linear prediction synthesis filter 130. 
Here, the first band is set to Fs1 [Hz] through Fe1 [Hz]. Incidentally, 
Fs0<Fs1<Fe1<Fe0. For example, Fs1=50 [Hz], Fe1=4000 [Hz]. 
Further, the first band pass filter 120 is provided with a characteristic of 
restricting a band to the first band and can also be realized by a higher 
degree linear prediction filter 1/B(z) characterized in having a linear 
prediction degree of about 100 degree. In this case, when notation 
Nph designates a linear prediction degree and the linear prediction 
coefficient is pi, i=1, Nph, a transfer function 1/B(z) of the higher 
degree linear prediction filter is represented by Equation (1) as follows. 
With regard to the higher degree linear prediction filter, a description is 
given in Reference 2, mentioned above. 

Equation (1) 

Nph 

1/B(z) = l/(1- ZAz') 
i=i 

The linear prediction synthesis filter 130 is provided with a table 
stored with quantized linear prediction coefficients. The linear 
prediction synthesis filter 130 is inputted with the first excitation vector 
outputted from the first band pass filter 120 and an index in 
correspondence with the quantized linear prediction coefficient outputted 
from the linear prediction coefficient calculating circuit 170. Further, the 
linear prediction synthesis filter 130 reads the quantized linear prediction 
coefficient in correspondence with the index from the table. By driving 
a synthesis filter 1/A(z) set with the quantized linear prediction coefficient 
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by the first excitation vector, a first reproduction signal (reproduced 
vector) is generated. The first reproduced vector is outputted to a first 
differencer 180. In this case, a transfer function 1/A(z) of the synthesis 
filter is expressed by Equation (2) as follows. 
Equation (2) 

n p 

l/A(z) = l/(l-£a 1 V) 

i=l 

The first differencer 180 is inputted with the input vector via the 

input terminal 10 and the first reproduced vector outputted from the 
linear prediction synthesizing vector 130. The first differencer 180 
calculates a difference therebetween and outputs a difference value 
thereof as a first difference vector to the weighting filter 140 and the 
linear prediction inverse filter 230. 

The first weighting filter 140 is inputted with the first difference 
vector outputted from the first differencer 180 and the linear prediction 
coefficient outputted from the linear prediction coefficient calculating 
circuit 1 70. The first weighting filter 140 generates a weighting filter 
W(z) in correspondence with an auditory characteristic of a human being 
by using the linear prediction coefficient and drives the weighting filter by 
the first difference vector. Thereby, a first weighted difference vector is 
provided. Further, the first weighted difference vector is outputted to 
the first minimizing circuit 150. In this case, a transfer function W(z) of 
the weighting filter is expressed as W(z)=Q(z/y1)/Q(z/y2). Incidentally, 
Q(z/y1) is expressed by Equation (3) as follows. y1 and y2 are 
constants and, for example, y1=0.9, y2=0.6. Further, with regard to 
details of the weighting filter, a description is given in Reference 1 , 
mentioned above. 
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Equation (3) 

n p n p . . 

Q(z/n) = 1 - 2>i>Vz^ Q (7.1 y 2 ) = 1 - S^rz'z' 

i=l i=l 

The first minimizing circuit 150 successively outputs indexes in 
correspondence with all of the first sound source vectors stored in the 
first sound source generating circuit 110 to the first sound source 
generating circuit 110 and successively outputs indexes in 
correspondence with all of the first gains stored in the first gain circuit 
160 to the first gain circuit 160. Further, the first minimizing circuit 150 
receives the first weighted difference vectors successively outputted 
from the weighting filter 140, calculates a norm thereof, selects the first 
sound source vector and the first gain minimizing the norm and outputs 
an index in correspondence therewith to the code outputting circuit 290. 

The linear prediction inverse filter 230 is provided with a table 
stored with quantized linear prediction coefficients. The linear 
prediction inverse filter 230 is inputted with the index in correspondence 
with the quantized linear prediction coefficient outputted from the linear 
prediction coefficient calculating circuit 170 and the first difference vector 
outputted from the first differencer 1 80. Further, the linear prediction 
inverse filter 230 reads a quantized linear prediction coefficient in 
correspondence with the index from the table. By driving an inverse 
filter A(z) set with the quantized linear prediction coefficient by the first 
difference vector, a first residue vector is provided. Further, the first 
residue vector is outputted to an orthogonal transformation circuit 240. 
A transfer function A(z) of the inverse filter is expressed by Equation (4) 
as follows. 

Equation (4) 
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a(z) = i - 2>;z' 

i = l 

The orthogonal transformation circuit 240 is inputted with the first 
residue vector outputted from the linear prediction inverse filter 230. 
The orthogonal transformation circuit 240 subjects the first residue 
vector to orthogonal transformation and generates a second residue 
vector. The second residue vector is outputted to a band selecting 
circuit 250. Here, as the orthogonal transformation, discrete cosine 
transform (DCT) can be used. 

The band selecting circuit 250 is inputted with the second residue 
vector outputted from the orthogonal transformation circuit 240. As 
shown by Fig. 3, in the second residue vector, there are generated Nsbv 
pieces of subvectors using components included in a second band. 
Although an arbitrary band can be set as the second band, in this case, 
the second band is constituted by a band of Fs2 [Hz] through Fe2 [Hz]. 
Incidentally, FsO<Fs2<Fe2<FeO. IN this case, the first band and the 
second band do not overlap, that is, Fe1<Fs2. For example, Fs2=4000 
[Hz], Fe2=7000 [Hz]. The band selecting circuit 250 outputs Nsbv 
pieces of the subvectors to an orthogonal transformation coefficient 
quantizing circuit 260. 

The orthogonal transformation coefficient quantizing circuit 260 is 
inputted with Nsvb pieces of the subvectors outputted from the band 
selecting circuit 250. The orthogonal transformation coefficient 
quantizing circuit 260 is provided with a table stored with quantized 
values (shape code vectors) in correspondence with shapes of the 
subvectors and a table stored with quantized values (quantization gains) 
in correspondence with gains of the subvectors. Quantization errors 
are minimized with regard to respectives of Nsbv pieces of the inputted 
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subvectors. The orthogonal transformation coefficient quantizing circuit 
260 selects the quantized values of the shapes and the quantized values 
of the gains from the tables and outputs corresponding indexes to the 
code outputting circuit 290. 

Here, a supplementary explanation will be given of a constitution 
of the orthogonal transformation coefficient quantizing circuit 260 in 
reference to Fig. 4. In Fig. 4, there are Nsbv pieces of blocks 
surrounded by dotted lines. In the respective blocks, Nsbv pieces of 
the subvectors are quantized. Nsbv pieces of the subvectors are 
expressed by Equation (5) as follows. 

Equation (5) 

e sb,0 ( n X ' ' * > e sb, N sbv -i (n), n = 0, • ■ • , L - 1 

Processing with regard to the respective subvectors is common. 
An explanation will be given of a processing with regard to e sb,0 (n), 
n=0, .... L-1. 

Subvectors e sb.O (n), n=0, .... L-1 are inputted via an input 
terminal 2650. A table 261 0 is stored with Nc,0 pieces of shape code 

vectors c0[j] (n), n=0 L-1, j=0, .... Nc,0-1. Here, notation L 

designates a vector length and notation "j" designates an index. The 
table 2610 inputs indexes outputted from a minimizing circuit 2630 and 
outputs the shape code vectors c0[j] (n), n=0, .... L-1 in correspondence 
with the indexes to a gain circuit 2620. A table provided by the gain 
circuit 2620 is stored with Ng,0 pieces of quantization gains g0[k], 
k=0, .... Ng,0-1. Here, notation "k" designates an index. 

The gain circuit 2620 is inputted with the shape code vectors c0[j] 
(n), n=0, .... L-1 outputted from the table 2610 and is inputted with the 
indexes outputted from the minimizing circuit 2630. The quantization 
gain g0[k] in correspondence with index is read from the table. 
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Quantized subvectors e' sb.O (n), n=0, .... L-1 provided by multiplying the 
quantization gains gO[k] by the shape code vectors cO[j] (n), n=0, .... L-1 
are outputted to a differencer 2640. The differencer 2640 calculates 
differences between the subvectors e sb,0 (n), n=0, .... L-1 inputted via 

an input terminal 2650 and the quantized subvectors e' sb,0 (n), n=0 

L-1 inputted from the gain circuit 2620. Difference values thereof are 
outputted to the minimizing circuit 2630 as difference vectors. The 
minimizing circuit 2630 successively outputs indexes in correspondence 
with all of the shape code vectors c0[j], (n), n=0, .... L-1 and j=0, 
Nc.0-1 stored in the table 261 0 to the table 261 0. Indexes in 
correspondence with all of the quantization gains g0[k] k=0, .... Ng,0-1 
stored in the gain circuit 2620 are successively outputted to the gain 
circuit 2620. Further, the difference vectors are successively inputted 
from the differencer 2640 and norms DO thereof are calculated. The 
minimizing circuit 2630 selects the shape code vectors c0[j] (n), n=0, 
L-1 and the quantization gains g0[k] minimizing the norms DO. Indexes 
in correspondence therewith are outputted to an index outputting circuit 
2660. Similar processing is carried out with respect to subvectors 
shown by Equation (6) as follows. 
Equation (6) 

e s b,i( n V--> e s b,N sbv -i( n )> n = 0,---,L-l 

The index outputting circuit 2660 is inputted with Nsbv pieces of 
the indexes outputted from the minimizing circuit. A set of the indexes 
summarizing these are outputted to the code outputting circuit 290 via 
an output terminal 2670. Further, with regard to determination of the 
shape code vectors c0[j] (n), n=0, L-1 and the quantization gains 
g0[k] minimizing the norm DO, the following method can also be used. 
The norm DO is expressed by Equation (7) as follows. 
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Equation (7) 

L-l 2 
r>=0 

= ^lesbflM - SO**' • co^in) ) 2 , 
n=0 

_J = 0,_- f N Cf0 -1, k = 0, Ng /0 - i 

Here, when an optimum gain g' 0 is set as shown by Equation (8) 
as follows, the norm DO can be modified as shown by Equation (8) or 
Equation (9) as follows. 

Equation (8) 

L-l r , 

go ' = n=0 i> _ 1 , j = 0, ... , N Cr0 - 1 

n = 0 

Equation (9) 

L-l 



n=0 

D 0 = Z J e sb r 0^ 



( Z e sb,0M ■ <=Q^(n) ) 2 



n=0 L J^c 0 b\n) 2 

n = 0 



j = 0, ••• , N Cr Q 



- 1 



Therefore, calculation of c0[j] (n), n=0, .... L-1, j=0, .... Nc,0-1 

minimizing DO, is equivalent to calculation of cOjj] (n), n=0 L-1, j=0, 

Nc,0-1 maximizing a second term of an equation shown by above 
Equation (9). Hence, after calculating cOG] (n), n=0, L-1 , j=j opt 
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maximizing the second term of the equation shown by above Equation 
(9), gO[k], k=k opt minimizing an equation shown by above Equation (7) 

is calculated with respect to cO[j] (n), n=0 L-1 , j=j opt. Here, as cOU] 

(n), n=0, L-1, j=j opt, a plurality of candidates are selected 
successively from larger values of the second term of the equation 
shown by above Equation (9). gO[k], k=k opt minimizing the equation 
shown by above Equation (7) is calculated for respectives thereof. cO[j] 
(n), n=0, L-1 , j=j opt and gO[k], k=k opt minimizing the norm DO can 
also be selected finally from these. A similar method is applicable to 
subvectors shown by Equation (10) as follows. 
Equation (10) 

e sb,l^-'VN !bv -.^- n = () '-' L - 1 

By the above-described, an explanation of the orthogonal 
transformation coefficient quantizing circuit 260 in reference to Fig. 4 is 
finished. In the following, the explanation in reference to Fig. 3 will be 
given again. 

The code outputting circuit 290 is inputted with indexes in 
correspondence with the quantized linear prediction coefficients 
outputted from the linear prediction coefficient calculating circuit 170. 
Further, the code outputting circuit 290 is inputted with indexes outputted 
from the first minimizing circuit 150 and in correspondence with 
respectives of the first sound source vectors and the first gains. 
Further, the code outputting circuit 290 is inputted with a set of indexes 
outputted from the orthogonal transformation coefficient quantizing 
circuit 260 and constituted by indexes of the shape code vectors and the 
quantization gains with respect to Nsbv pieces of subvectors. Further, 
as schematically shown by Fig. 31 , the respective indexes are converted 
into codes of bit series and are outputted via an output terminal 20. 
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Although the first embodiment explained in reference to Fig. 3 
shows the case in which the number of bands is 2, an explanation will be 
given of a case in which the number of bands is expanded to 3 or more 
as follows. 

Fig. 3 can be rewritten as shown by fig. 7. Here, a first coding 
circuit 1001 of Fig. 7 is equivalent to Fig. 8. A second coding circuit 
1 002 of Fig. 7 is equivalent to Fig. 9. Respective blocks constituting Fig. 
8 and Fig. 9 are the same as respective blocks explained in Fig. 3. 

The second embodiment according to the invention is realized by 
expanding the number of bands to 3 in the first embodiment. A 
constitution of a speech and music signal coder according to the second 
embodiment can be represented by a block diagram shown in Fig. 10. 
In the drawing, the first coding circuit 1001 is equivalent to Fig. 8, the 
second coding circuit 1002 is equivalent to Fig. 8 and the third coding 
circuit 1003 is equivalent to Fig. 9. A code outputting circuit 2901 is 
inputted with an index outputted from the linear prediction coefficient 
calculating circuit 170, inputted with an index outputted from the first 
coding circuit 1001 , inputted with an index outputted from the second 
coding circuit 1002 and inputted with a set of indexes outputted from the 
third coding circuit 1003. The respective indexes are converted into 
codes of bit series and outputted via the input terminal 20. 

A third embodiment of the invention is realized by expanding the 
number of bands to N in the first embodiment. A constitution of a 
speech and music signal coder according to the third embodiment can 
be represented by a block diagram shown in Fig. 11 . Here, the first 
coding circuit 1001 through an (N-1)-th coding circuit 1004 are 
equivalent to Fig. 8. An N-th coding circuit 1005 is equivalent to Fig. 9. 
A code outputting circuit 2902 is inputted with an index outputted from 
the linear prediction coefficient calculating circuit 170, inputted with 
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indexes outputted from respectives of the first coding circuit 1001 
through the (N-1)-th coding circuit 1004 and inputted with a set of 
indexes outputted from the N-th coding circuit 1005. Further, the 
respective indexes are converted into codes of bit series and outputted 
via the output terminal 20. 

According to the first embodiment, the first coding circuit 1001 
shown in Fig. 7 is based on a coding system using an A-b-S (Analysis- 
by-Synthesis) method. However, according to the first embodiment, a 
"" ~ coding system other than the A-b-S method is also applicable to the first 
coding circuit 1001 . In the following, an explanation will be given of a 
O case in which a coding system using time frequency conversion is 

hi applied to the first coding circuit 1001 as a coding system other than the 

Ji A-b-S method. 

J| A fourth embodiment of the invention is realized by applying the 

coding system using time frequency conversion in the first embodiment. 
I U A constitution of a speech and music signal coder according to the fourth 

! y embodiment of the invention can be represented by a block diagram 

K shown in Fig. 1 3. In this case, a first coding circuit 1011 is equivalent to 

Fig. 12. A second coding circuit 1002 is equivalent to Fig. 9. Among 
blocks constituting Fig. 12, the linear prediction inverse filter 230, the 
orthogonal transformation circuit 240, the band selecting circuit 250 and 
the orthogonal transformation coefficient quantizing circuit 260 are the 
same as the respective blocks explained in Fig. 3. Further, an 
orthogonal transformation coefficient inverse quantizing circuit 460 and 
an orthogonal inverse transformation circuit 440 and the linear prediction 
synthesis filter 131 are the same as blocks constituting a vocal music 
decoding apparatus in correspondence with the first embodiment by a 
ninth embodiment, mentioned later. 
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An explanation of the orthogonal transformation coefficient inverse 
quantizing circuit 460, the orthogonal inverse transformation circuit 440 
and the linear prediction synthesis filter 131 will be omitted here since an 
explanation thereof will be given in the ninth embodiment in reference to 
Fig. 15. A code outputting circuit 2903 is inputted with an index 
outputted from the linear prediction coefficient calculating circuit 170, 
inputted with a set of indexes outputted from the first coding circuit 1011 
and inputted with a set of indexes outputted from the second coding 
circuit 1002. Further, the respective indexes are converted into codes 
of bit series and outputted via the output terminal 20. 

A fifth embodiment of the invention is realized by expanding a 
number of bands to 3 in the fourth embodiment. A constitution of a 
speech and music signal coder according to the fifth embodiment of the 
invention can be represented by a block diagram shown in Fig. 14. In 
this case, the first coding circuit 1011 is equivalent to Fig. 12, a second 
coding circuit 1012 is equivalent to Fig. 12 and the third coding circuit 
1003 is equivalent to Fig. 9. A code outputting circuit 2904 is inputted 
with an index outputted from the linear prediction coefficient calculating 
circuit 170, inputted with a set of indexes outputted from the first coding 
circuit 1011 , inputted with a set of indexes outputted from the second 
coding circuit 1012 and inputted with a set of indexes outputted from the 
third coding circuit 1003. The respective indexes are converted into 
codes of bit series and outputted via the output terminal 20. 

A sixth embodiment of the invention is realized by expanding the 
number of bands to N in the fourth embodiment. A constitution of a 
speech and music signal coder according to the sixth embodiment of the 
invention can be represented by a block diagram shown in Fig. 15. In 
this case, respectives of the first coding circuit 1011 through an (N-1)-th 
coding circuit 1014 are equivalent to Fig. 12. An N-th coding circuit 
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1 005 is equivalent to Fig. 9. A code outputting circuit 2905 is inputted 
with an index outputted from the linear prediction coefficient calculating 
circuit 170, inputted with sets of indexes outputted from respectives of 
the first coding circuit 1011 through the (N-1)-th coding circuit 1014 and 
inputted with a set of indexes outputted from the N-th coding circuit 1005. 
Further, the respective indexes are converted into codes of bit series 
and outputted via the output terminal 20. 

Fig. 16 is a block diagram showing a constitution of a speech and 
music signal coder according to a seventh embodiment of the invention. 
A block surrounded by dotted lines in the drawing is referred to as a pitch 
prediction filter. Fig. 16 is provided by adding the pitch prediction filter 
to Fig. 3. In the following, an explanation will be given of a storing 
circuit 510, a pitch signal generating circuit 112, a third gain circuit 162, 
an adder 184, a first minimizing circuit 550 and a code outputting circuit 
590 which are blocks different from those in Fig. 3. 

The storing circuit 510 inputs a fifth sound source signal from the 
adder 184 and holds the fifth sound source signal. The storing circuit 
510 outputs the fifth sound source signal which has been inputted in the 
past and held to the pitch signal generating circuit 112. 

The pitch signal generating circuit 112 is inputted with the past fifth 
sound source signal held in the storing circuit 510 and an index 
outputted from the first minimizing circuit 550. The index designates a 
delay "d". Further, as shown in Fig. 32, in the past fifth sound source 
signal, a first pitch vector is generated by cutting out a signal of L sample 
in correspondence with a vector length from a point which is past from a 
start point of a current frame by d sample. In this case, in the case of 
d<L, a signal of d sample is cut out, the cut-out d sample is repeatedly 
connected and the first pitch vector having the vector length of L sample 
is generated. The pitch signal generating circuit 112 outputs the first 
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pitch vector to the third gain circuit 162. 

The third gain circuit 162 is provided with a table stored with 
values of gains. The third gain circuit 162 is inputted with an index 
outputted from the first minimizing circuit 550 and the first pitch vector 
outputted from the pitch signal generating circuit 112. A third gain in 
correspondence with the index is read from the table, the third gain is 
multiplied by the first pitch vector to thereby form a second pitch vector 
and the generated second pitch vector is outputted to the adder 184. 

The adder 184 is inputted with the second sound source vector 
outputted from the first gain circuit 160 and the second pitch vector 
outputted from the third gain circuit 1 62. The adder 1 84 calculates a 
sum of the second sound source vector and the second pitch vector, 
constitutes a fifth sound source vector by the value and outputs the 
sound source vector to the first band pass filter 120. 

In the first minimizing circuit 550, indexes in correspondence with 
all of the first sound source vectors stored in the first sound source 
generating vector 110 are successively outputted to the first sound 
source generating circuit 110. Indexes in correspondence with all of 
the delays "d" in a range prescribed in the pitch signal generating circuit 
112, are successively outputted to the pitch signal generating circuit 112. 
Indexes in correspondence with all of the first gains stored in the first 
gain circuit 160 are successively outputted to the first gain circuit 160. 
Indexes in correspondence with all of third gains stored in the third gain 
circuit 162 are successively outputted to the third gain circuit 162. 
Further, the first minimizing circuit 550 successively inputs the first 
weighted difference vectors outputted from the weighting filter 140 and 
calculates the norm. The first minimizing circuit 550 selects the first 
sound source vector, the delay "d", the first gain and the third gain 
minimizing the norm, summarizes indexes in correspondence therewith 
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and outputs the indexes to the code outputting circuit 590. 

The code outputting circuit 590 is inputted with an index in 
correspondence with the quantized linear prediction coefficient outputted 
from the linear prediction coefficient calculating circuit 170. The code 
outputting circuit 590 is inputted with the indexes outputted from the first 
minimizing circuit 550 and in correspondence with respectives of the first 
sound source vector, the delay "d", the first gain and the third gain. The 
code outputting circuit 590 is inputted with a set of indexes outputted 
from the orthogonal transformation coefficient quantizing circuit 260 and 
constituted by indexes of shape code vectors and quantization gains in 
correspondence with Nsbv pieces of subvectors. Further, the 
respective indexes are converted into codes in bit series and outputted 
via the output terminal 20. 

Fig. 17 is a block diagram showing a constitution of a speech and 
music signal coder according to an eighth embodiment of the invention. 
In the following, an explanation will be given of a down-sampling circuit 
780, a first linear prediction coefficient calculating circuit 770, a first 
linear prediction synthesis filter 132, a third differencer 183, an up- 
sampling circuit 781, a first differencer 180, a second linear prediction 
coefficient calculating circuit 771 , a third linear prediction coefficient 
calculating circuit 772, a linear prediction inverse filter 730 and a code 
outputting circuit 790 which are blocks different from those in Fig. 16. 

The down-sampling circuit 780 receives an input vector from the 
input terminal to and outputs a second input vector provided by down- 
sampling the inpihsyector and having a first band to the first linear 
prediction coefficienVcalculating circuit 770 and the third differencer 183. 
Here, the first band is set to Fs1 [Hz] through Fe1 [Hz] similar to the first 
embodiment and a band ofsthe input vector is set to FsO [Hz] through 
FeO [Hz] (third band). With regard to a constitution of the down- 
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sampling circuit, a description is given to paragraph 4.1.1 of a Reference 
(Reference 6)\tled as "Multirate Systems and Filter Banks" by P.P. 
Vaidyanathan. \ 

The first linear prediction coefficient calculating circuit 770 
receives the second input vector from the down-sampling circuit 780, 
carries out linear prediction analysis with regard to the second input 
vector, calculates a first linear prediction coefficient having the first band, 
further, quantizes the first linear prediction coefficient and calculates a 
first quantized linear prediction coefficient. The first linear prediction 
coefficient calculating circuit 770 outputs the first linear prediction 
coefficient to the first weighting filter 140 and outputs an index in 
correspondence with the first quantized linear prediction coefficient to 
the first linear prediction synthesis filter 132, the linear prediction inverse 
filter 730 and the third linear prediction coefficient calculating circuit 772 
and the code outputting circuit 790. 

The first linear prediction synthesis filter 132 is provided with a 
table stored with first quantized linear prediction coefficients. The first 
linear prediction synthesis filter 132 is inputted with the fifth sound 
source vector outputted from the adder 184 and the index in 
correspondence with the first quantized linear prediction coefficient 
outputted from the first linear prediction coefficient calculating circuit 770. 
Further, the first linear prediction synthesis filter 132 reads a first 
quantized linear prediction coefficient in correspondence with the index 
from the table and drives the synthesis filter set with the first quantized 
linear prediction coefficient by the fifth sound source vector to thereby 
form a first reproduced vector having the first band. Further, the first 
reproduced vector is outputted to the third differencer 183 and the up- 
sampling circuit 781. 
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The third differencer 1 83 receives the first reproduced vector 
outputted from the first linear prediction synthesis filter 132 and the 
second input vector outputted from the down-sampling circuit 780, 
calculates a difference therebetween and outputs the difference as a 
second difference vector to the weighting filter 140. 

The up-sampling circuit 781 receives the first reproduced vector 
outputted fromythe first linear prediction synthesis filter 132, upsamples 
the first reproduced vector and generates a third reproduced vector 
having a third banV In this case, the third band falls in a range of FsO 
[Hz] through FeO [HzK The up-sampling circuit 781 outputs the third 
reproduced vector to theN^rst differencer 180. With regard to a 
constitution of the up-samplihg circuit, a description is given to 
paragraph 4.1.1 of the reference\Reference 6) titled as "Multirate 
systems and Filter Banks" by P.P. N^yanathan. 

The first differencer 180 receives the input vector via the input 
terminal 10 and the third reproduced vector outputted from the up- 
sampling circuit 781, calculates a difference therebetween and outputs 
the difference as a first difference vector to the linear prediction inverse 
filter 730. 

The second linear prediction coefficient calculating circuit 771 
receives the input vector from the input terminal 10, carries out linear 
prediction analysis with respect to the input vector, calculates a second 
linear prediction coefficient having the third band and outputs the second 
linear prediction coefficient to the third linear prediction coefficient 
calculating circuit 772. 

Trfe third linear prediction coefficient calculating circuit 772 is 
provided with a table stored with first quantized linear prediction 
coefficients. The third linear prediction coefficient calculating circuit 
772 is inputted wrtfothe second linear prediction coefficient outputted 
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from the second linear prediction coefficient calculating circuit 771 and 
the index in correspondence with the first quantized linear prediction 
coefficient outputted from the first linear prediction coefficient calculating 
circuit 770. TheViird linear prediction coefficient calculating circuit 772 
reads a first quantised linear prediction coefficient in correspondence 
with the index from tt^e table, converts the first quantized linear 
prediction coefficient itoto LSP, further and subjects LSP to sampling 
frequency conversion tathereby form first LSP in correspondence with a 
sampling frequency of the input signal. Further, the third linear 
prediction coefficient calculating circuit 772 converts the second linear 
prediction coefficient into LSP and generates a second LSP. The third 
linear prediction coefficient calculating circuit 772 calculates a difference 
between second LSP and first LSP. A difference value thereof is 
defined as third LSP. Here, With regard to the sampling frequency 
conversion of LSP, a description^ given to Japanese Patent Application 
No. 202475/1997 (Reference 7). Vhe third LSP is quantized and the 
quantized third LSP is converted into^a linear prediction coefficient and a 
third quantized linear prediction coefficient having the third band is 
generated. Further, the index in correspondence with the third 
quantized linear prediction coefficient is outoutted to the linear prediction 
inverse filter 730 and the code outputting circuit 790. 

The linear prediction inverse filter 730 is provided with a first table 
stored with first quantized linear prediction coefficients and a second 
table stored with third quantized linear prediction coefficients. The 
linear prediction inverse filter 730 is inputted with a first index in 
correspondence with the first quantized linear prediction coefficient 
outputted from the first linear prediction coefficient calculating circuit 770 
and a second index in correspondence with the third quantized linear 
prediction coefficient outputted from the third linear prediction coefficient 
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calculating circuit 772 and the first difference vector outputted from the 
first differencer 180. The linear prediction inverse filter 730 reads a first 
quantized linear prediction coefficient in correspondence with the first 
index from the first table, converts the first quantized linear prediction 
coefficient into LSP, further, subjects LSP to sampling frequency 
conversion to thereby generate first LSP in correspondence with the 
sampling frequency of the input signal. Further, the third quantized 
linear prediction coefficient in correspondence with the second index is 
read from the second table and converted into LSP to thereby generate 
third LSP. Next, the first LSP and the third LSP are added together to 
*.3 thereby generaate second LSP. The linear prediction inverse filter 730 

\,1 converts the second LSP into a linear prediction coefficient and 

i;q generates a second quantized linear prediction coefficient. The linear 

fi prediction inverse filter 730 generates a first residue vector by driving the 

^ inverse filter set with the second quantized linear prediction coefficient 

; U by the first difference vector. The first residue vector is outputted to the 

!'LJ orthogonal transformation circuit 240. 

il The code outputting circuit 790 is inputted with the index in 

correspondence with the first quantized linear prediction coefficient 
outputted from the first linear prediction coefficient calculating circuit 770, 
the index in correspondence with the third quantized linear prediction 
coefficient outputted from the third linear prediction coefficient 
calculating circuit 772, the index outputted from the first minimizing 
circuit 550 and in correspondence with respectives of the first sound 
source vector, the delay "d", the first gain and the third gain and the set 
of indexes outputted from the orthogonal transformation coefficient 
quantizing circuit 260 and constituted by indexes of the shape code 
vectors and the quantization gains in correspondence with Nsbv pieces 
of the subvectors. The respective indexes are converted into codes in 
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bit series and outputted via the output terminal 20. 

Fig. 18 is a block diagram showing a constitution of a vocal music 
signal decoding apparatus in correspondence with the first embodiment 
by the ninth embodiment of the invention. The decoding apparatus is 
inputted with codes in bit series from the input terminal 30. 

A code inputting circuit 410 converts codes in bit series inputted 
from the input terminal 30 into indexes. An index in correspondence 
with the first sound source vector is outputted to the first sound source 
generating circuit 110. An index in correspondence with the first gain is 
outputted to the first gain circuit 160. An index in correspondence with 
the quantized linear prediction coefficient is outputted to the linear 
prediction synthesis filter 130 and the linear prediction synthesis filter 
131. A set of indexes summarizing indexes in correspondence with 
respectives of the shape code vectors and the quantized gains with 
regard to the subvectors for Nsbv pieces of the subvectors is outputted 
to the orthogonal transformation coefficient inverse quantizing circuit 
460. 

The first sound source generating circuit 110 receives the index 
outputted from the code inputting circuit 410, reads the first sound 
source vector in correspondence with the index from a table stored with 
a plurality of sound source vectors and outputs the first sound source 
vector to the first gain circuit 160. 

The first gain circuit 160 is provided with a table stored with 
quantized gains. The first gain circuit 160 receives the index outputted 
from the code inputting circuit 410 and the first sound source vector 
outputted from the first sound source generating circuit 110, reads the 
first gain in correspondence with the index from the table, multiplies the 
first gain by the first sound source vector and generates the second 
sound source vector. The generated second sound source vector is 
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outputted to the first band pass filter 120. 

The first band pass filter 120 is inputted with the second sound 
source vector outputted from the first gain circuit 160. The band of the 
second sound source vector is restricted to the first band by the filter to 
thereby generate the first excitation vector. The first band pass filter 
120 outputs the first excitation vector to the linear prediction synthesis 
filter 130. 

An explanation will be given of a constitution of the orthogonal 
transformation coefficient inverse quantizing circuit 460 in reference to 
Fig. 20. In Fig. 20, there are Nsbv pieces of blocks surrounded by 
dotted lines. Nsbv pieces of quantized subvectors prescribed at the 
band selecting circuit 250 of Fig. 3 by the respective blocks, are 
represented by Equation (11) as follows. Nsbv pieces of the quantized 
subvectors are decoded. 

Equation (11) 

A decoding processing with regard to the respective quantized 
subvectors is common. In the following, an explanation will be given of 
a processing with respect to e' sb,0 (n), n=0, L-1 . Similar to the 
processing at the orthogonal transformation coefficient quantizing circuit 

260 in Fig. 3, the quantized subvectors e' sb.O (n), n=0 L-1 is 

represented by a product of the shape code vector c0[j] (n) N=0, L-1 
and the quantization gain g0[k]. Here, notations "j" and "k" represent 
indexes. An index inputting circuit 4630 inputs a set "if of indexes 
constituted by indexes of the shape code vectors and the quantization 
gains with regard to Nsbv pieces of the quantized subvectors outputted 
from the code inputting circuit 410 via an input terminal 4650. Further, 
from the set "if of indexes, an index i sbs.O designating the shape code 
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vector cO[j] (n), n=0, .... L-1 and an index i sbg.O designating the 
quantization gain gO[k], are taken out, i sbs.O is outputted to a table 4610 
and i sbg.O is outputted to a gain circuit 4620. The table 461 0 is stored 

with c0[j] (n), n=0, .... L-1 , j=0 Nc,0-1 . The table 4610 inputs the 

index i sbs.O outputted from the index inputting circuit 4630 and outputs 
the shape code vector cOR] (n), n=0, L-1 , j=i sbs.O in correspondence 
with i sbs.O to the gain circuit 4620. A table provided to the gain circuit 
4620 is stored with g0[k], k=0, .... Ng.0-1. The gain circuit 4620 
receives c0[j] (n), n=0, .... L-1 , j=i sbs.O outputted from the table 4610 
and the index i sbg.O outputted from the index inputting circuit 4630, 
reads the quantization gain g0[k], k=i sbg.O in correspondence with i 
sbg.O from the table and outputs the quantized subvector e' sb.O (n), 

n=0, .... L-1 provided by multiplying c0[j] (n) n=0 L-1, j=i sbg.O by 

gO[k], k=i sbg.O to an all band vector generating circuit 4640. The all 
band vector generating circuit 4640 is inputted with the quantized 
subvectors e' sb.O (n), n=0, .... L-1 outputted from the gain circuit 4620. 
Further, the all band vector generating circuit 4640 is inputted with 
vectors provided by a processing similar to that of e' sb,0 (n), n=0, .... L- 
1 and represented by Equation (12) as follows. 
Equation (12) 

e ; l ,, 1 ( n >'-' e 'sb,N sbv - 1 ( n )' n = o '-' L - 1 

As shown by Fig. 19, by arranging Nsbv pieces of the quantized 
subvectors (Equation (11)) in the second band prescribed by the band 
selecting circuit 250 in Fig. 3 and arranging null vector to other than the 
second band, the second excitation vector in correspondence with all of 
the bands (for example, when sampling frequency of reproduction signal 
is 16 kHz, 8 kHz band), is generated and the second excitation vector is 
outputted to the orthogonal inverse transformation circuit 440 via an 
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output terminal 4660. 

The orthogonal inverse transformation circuit 440 receives the 
second excitation vector outputted from the orthogonal transformation 
coefficient inverse quantizing circuit 460 and subjects the second 
excitation vector to orthogonal inverse transformation to thereby provide 
the third excitation vector. Further, the third excitation vector is 
outputted to the linear prediction synthesis filter 131 . In this case, as 
orthogonal inverse transformation, inverse discrete cosine transform 
(IDCT) can be used. 

The linear prediction synthesis filter 130 is provided with a table 
stored with quantized linear prediction coefficients. The linear 
prediction synthesized filter 130 is inputted with the first excitation vector 
outputted from the first band pass filter 120 and the index in 
correspondence with the quantized linear prediction coefficient outputted 
from the code inputting circuit 410. Further, the linear prediction 
synthesis filter 130 reads the quantized linear prediction coefficient in 
correspondence with the index from the table and generates the first 
reproduced vector by driving the synthesized filter 1/A(z) set with the 
quantized linear prediction coefficient by the first excitation vector. 
Further, the first reproduced vector is outputted to the adder 182. 

The linear prediction synthesis filter 131 is provided with a table 
stored with quantized linear prediction coefficients. The linear 
prediction synthesis filter 131 is inputted with the third excitation vector 
outputted from the orthogonal inverse transformation circuit 440 and the 
index in correspondence with the quantized linear prediction coefficient 
outputted from the code inputting circuit 410. Further, the linear 
prediction synthesis filter 131 reads the quantized linear prediction 
coefficient in correspondence with the index from the table and 
generates the second reproduced vector by driving the synthesis filter 
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1/A(z) set with the quantized linear prediction coefficient by the third 
excitation vector. The second reproduced vector is outputted to the 
adder 182. 

The adder 182 receives the first reproduced vector outputted from 
the linear prediction synthesized filter 130 and the second reproduced 
vector outputted from the linear prediction synthesis filter 131 , calculates 
a sum of these and outputs the sum as the third reproduced vector via 
the output terminal 40. 

Although the ninth embodiment explained in reference to Fig. 18 
shows the case in which the number of bands is 2, in the following, an 
explanation will be given of a case in which the number of bands is 
expanded to 3 or more. 

Fig. 18 can be rewritten as shown by Fig. 21 . In this case, a first 
decoding circuit 1051 of Fig. 21 is equivalent to Fig. 22, a second 
decoding circuit 1052 of Fig. 21 is equivalent to Fig. 23 and the 
respective blocks constituting Fig. 22 and Fig. 23 are the same as 
respective blocks explained in reference to Fig. 18. 

A tenth embodiment of the invention is realized by expanding the 
number of bands to 3 in the ninth embodiment. A constitution of a vocal 
music signal decoding apparatus according to the tenth embodiment of 
the invention can be represented by a block diagram shown in Fig. 24. 
In this case, the first decoding circuit 1051 is equivalent to Fig. 22, the 
second decoding circuit 1052 is equivalent to Fig. 22 and a third 
decoding circuit 1053 is equivalent to Fig. 23. The code input circuit 
4101 converts codes in bit series inputted from the input terminal 30 into 
indexes, outputs an index in correspondence with the quantized linear 
prediction coefficient to the first decoding circuit 1051 , the second 
decoding circuit 1052 and the third decoding circuit 1053, outputs 
indexes in correspondence with sound source vectors and gains to the 
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first decoding circuit 1051 and the second decoding circuit 1052 and 
outputs a set of indexes in correspondence with the shape code vectors 
and the quantization gains with regard to the subvectors to the third 
decoding circuit 1053. 

An eleventh element of the invention is realized by expanding a 
number of bands to N in the ninth embodiment. A constitution of a 
vocal music signal decoding apparatus according to the eleventh 
embodiment of the invention can be represented by a block diagram 
shown in Fig. 25. In this case, respectives of the first decoding circuit 
1051 through an (N-1)-th decoding circuit 1054 are equivalent to Fig. 22 
and an N-th decoding circuit 1055 is equivalent to Fig. 23. The code 
inputting circuit 4102 converts codes in bit series inputted from the input 
terminal 30 into indexes, outputs an index in correspondence with the 
quantized linear prediction coefficient to respectives of the first decoding 
circuit 1051 through the (N-1)-th decoding circuit 1054 and the N-th 
decoding circuit 1055, outputs indexes in correspondence with the 
sound source vectors and the gains to respectives of the first decoding 
circuit 1051 through the (N-1)-th decoding circuit 1054 and outputs a set 
of indexes in correspondence with the shape code vectors and the 
quantization gains of the subvectors to the N-th decoding circuit 1055. 

Although according to the ninth embodiment, the first decoding 
circuit 1051 in Fig. 21 is based on a decoding system in correspondence 
with the coding system using the A-b-S method, a decoding system in 
correspondence with a coding system other than the A-b-S method is 
applicable also to the first decoding circuit 1051 . In the following, an 
explanation will be given of a case in which a decoding system in 
correspondence with the coding system using time frequency 
conversion is applied to the first decoding circuit 1051 . 
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A twelfth embodiment of the invention is realized by applying the 
decoding system in correspondence with the coding system using time 
frequency conversion in the ninth embodiment. A constitution of a 
vocal music signal decoding apparatus according to the twelfth 
embodiment of the invention can be represented by a block diagram 
shown in Fig. 26. In the drawing, a first decoding circuit 1061 is 
equivalent to Fig. 23 and the second decoding circuit 1052 is equivalent 
to Fig. 23. A code inputting circuit 4103 converts codes in bit series 
inputted from the input terminal 30 into indexes, outputs an index in 
correspondence with the quantized linear prediction coefficient to the 
first decoding circuit 1061 and the second decoding circuit 1052 and 
outputs a set of indexes in correspondence with the shape code vectors 
and the quantization gains with regard to the subvectors to the first 
decoding circuit 1061 and the second decoding circuit 1052. 

A thirteenth embodiment of the invention is realized by expanding 
the number of bands to 3 in the twelfth embodiment. A constitution of a 
vocal music signal decoding apparatus according to the thirteenth 
embodiment of the invention can be represented by a block diagram 
shown in Fig. 27. In this case, the first decoding circuit 1061 is 
equivalent to Fig. 23, the second decoding circuit 1062 is equivalent to 
Fig. 23 and a third decoding circuit 1053 is equivalent to Fig. 23. The 
code inputting circuit 4104 converts codes in bit series inputted from the 
input terminal 30 into indexes, outputs an index in correspondence with 
the quantized linear prediction coefficient to the first decoding circuit 
1061, the second decoding circuit 1062 and the third decoding circuit 
1053 and outputs a set of indexes in correspondence with the shape 
code vectors and the quantization gains with regard to the subvectors to 
the first decoding circuit 1061, the second decoding circuit 1062 and the 
third decoding circuit 1053. 
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A fourteenth embodiment of the invention is realized by expanding 
the number of bands to N in the twelfth embodiment. A constitution of a 
vocal music signal decoding apparatus according to the fourteenth 
embodiment of the invention can be represented by a block diagram 
shown in Fig. 28. In this case, respectives of the first decoding circuit 
1061 through an (N-1)-th decoding circuit 1064 are equivalent to Fig. 23 
and an N-th decoding circuit 1055 is equivalent to Fig. 23. A code 
inputting circuit 4105 converts codes in bit series inputted from the input 
terminal 30 into indexes, outputs an index in correspondence with the 
quantized linear prediction coefficient to respectives of the first decoding 
circuit 1061 through the (N-1)-th decoding circuit 1064 and the N-th 
decoding circuit 1055 and outputs a set of indexes in correspondence 
with the shape code vectors and the quantization gains with regard to 
the subvectors to respectives of the first decoding circuit 1061 through 
the (N-1)-th decoding circuit 1064 and the N-th decoding circuit 1055. 

Fig. 29 is a block diagram showing a constitution of a vocal music 
signal decoding apparatus in correspondence with the seventh 
embodiment according to a fifteenth embodiment of the invention. In 
Fig. 29, blocks different from those in the ninth embodiment in Fig. 18 
are the storing circuit 510, the pitch signal generating circuit 112, the 
third gain circuit 162, the adder 184 and a code inputting circuit 610, 
however, the storing circuit 510, the pitch signal generating circuit 112, 
the third gain circuit 162 and the adder 184 are similar to those in Fig. 16 
and accordingly, an explanation thereof will be omitted and an 
explanation will be given of the coding inputting circuit 610. 

The code inputting circuit 610 converts codes in bit series inputted 
from the input terminal 30 into indexes. An index in correspondence 
with the first sound source vector is outputted to the first sound source 
generating circuit 110. An index in correspondence with the delay "d" is 



54 



outputted to the pitch signal generating circuit 112. An index in 
correspondence with the first gain is outputted to the first gain circuit 160. 
An index in correspondence with the third gain is outputted to the third 
gain circuit 162. An index in correspondence with the quantized linear 
prediction coefficient is outputted to the linear prediction synthesis filter 
1 30 and the linear prediction synthesis filter 131. A set of indexes 
summarizing indexes in correspondence with respectives of the shape 
code vectors and the quantization gains with regard to the subvectors for 
Nsbv pieces of the subvectors, is outputted to the orthogonal 
transformation coefficient inverse quantizing circuit 460. 

Fig. 30 is a block diagram showing a constitution of a vocal music 
signal decoding apparatus in correspondence with the eighth 
embodiment according to a sixteenth embodiment of the invention. In 
the following, an explanation will be given of a code inputting circuit 810, 
the first linear prediction coefficient synthesis filter 132, an up-sampling 
circuit 781 and a second linear prediction synthesis filter 831 which are 
blocks different from those in Fig. 29. 

The code inputting circuit 810 converts codes in bit series inputted 
from the input terminal 30 into indexes. An index in correspondence 
with the first sound source vector is outputted to the first sound source 
generating circuit 110. An index in correspondence with the delay "d" is 
outputted to the pitch signal generating circuit 112. An index in 
correspondence with the first gain is outputted to the first gain circuit 160. 
An index in correspondence with the third gain is outputted to the third 
gain circuit 162. An index in correspondence with the first quantized 
linear prediction coefficient is outputted to the first linear prediction 
synthesis filter 132 and the second linear prediction synthesis filter 831 . 
An index in correspondence with the third quantized linear prediction 
coefficient is outputted to the second linear prediction synthesis filter 831 . 
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A set of indexes summarizing indexes in correspondence with 
respectives of the shape code vectors and the quantization gains with 
regard to the subvectors for Nsbv pieces of the subvectors, is outputted 
to orthogonal transformation coefficient inverse quantizing circuit 460. 

The first linear prediction synthesis filter 132 is provided with a 
table stored with first quantized linear prediction coefficients. The first 
linear prediction synthesis filter 132 is inputted with the fifth sound 
source vector outputted from the adder 184 and the index in 
correspondence with the first quantized linear prediction coefficient 
outputted from the code inputting circuit 810. Further, by reading the 
first quantized linear prediction coefficient in correspondence with the 
index from the table and driving the synthesis filter set with the first 
quantized linear prediction coefficient by the fifth sound source vector, 
the first reproduced vector having the first band is provided. Further, 
the first reproduced vector is outputted to the up-sampling circuit 781 . 

The up-sampling circuit 781 inputs the first reproduced vector 
outputted from the first linear prediction synthesis filter 132, upsamples 
the first reproduced vector and provides the third reproduced vector 
having the third band. Further, the third reproduced vector is outputted 
to the first adder 182. 

The second linear prediction synthesis filter 831 is provided with a 
first table stored with first quantized linear prediction coefficients having 
the first band and a second table stored with third quantized linear 
prediction coefficients having the third band. The second linear 
prediction synthesis filter 831 is inputted with the third excitation vector 
outputted from the orthogonal inverse transformation circuit 440, the first 
index in correspondence with the first quantized linear prediction 
coefficient outputted from the code inputting circuit 810 and the second 
index in correspondence with the third quantized linear prediction 
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coefficient. The second linear prediction synthesis filter 831 reads the 
first quantized linear prediction coefficient in correspondence with the 
first index from the first table, converts the first quantized linear 
prediction coefficient into LSP, further, subjects the converted first 
quantized linear prediction coefficient to sampling frequency conversion 
to thereby generate first LSP in correspondence with the sampling 
frequency of the third reproduced vector. Further, the third quantized 
linear prediction coefficient in correspondence with the second index is 
read from the second table and converted into LSP to thereby generate 
third LSP. Further, second LSP provided by adding first LSP and third 
LSP, is converted into the linear prediction coefficient to thereby 
generate the second linear prediction coefficient. The second linear 
prediction synthesis filter 831 generates the second reproduced vector 
having the third band by driving the synthesis filter set with the second 
linear prediction coefficient by the third excitation vector. Further, the 
second reproduced vector is outputted to the adder 182. 

The adder 182 receives the third reproduced vector outputted from 
the up-sampling circuit 781 and the second reproduced vector outputted 
from the second linear prediction synthesis filter 831 , calculates a sum of 
these and outputs the sum as a fourth reproducing vector via the output 
terminal 40. 

Industrial Applicability 

According to the invention, a vocal music signal can excellently be 
coded over all of bands. The reason is that a first reproduction signal is 
generated by driving a linear prediction synthesis filter calculated from 
an input signal by a sound source signal having a band characteristic in 
correspondence with a low region of the input signal, a residual signal is 
generated by driving an inverse filter of the linear prediction synthesis 
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filter by a differential signal of the input signal and the first reproduction 
signal and a high region component of the residual signal is coded by 
using a coding system based on orthogonal transformation and 
accordingly, coding performance with regard to the high region 
component of the input signal is improved. 



